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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

(i) APPLICANT: Murphy, Brian R. 

Collins, Peter L. 
Whitehead, Stephen S. 
Bukreyev, Alexander A. 
Juhasz , Katalin 

(ii) TITLE OF INVENTION: PRODUCTION OF ATTENUATED RESPIRATORY 

SYNCYTIAL VIRUS VACCINES FROM CLONED NUCLEOTIDE SEQUENCES 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Townsend and Townsend and Crew LLP 

(B) STREET: Two Embarcadero Center, 8th Floor 

(C) CITY: San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94111-3834 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 15-JUL-1997 

(C) CLT^SIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/047,634 

(B) FILING DATE: 23 -MAY- 1997 

(vii) PRIOR APPLICATION DATA: 

(A) J^PLICATION NUMBER: US 60/046,141 

(B) FILING DATE: 09-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,773 

(B) FILING DATE: 15-JUL-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Parmelee, Steven W. 

(B) REGISTRATION NUMBER: 31,990 

(C) REFERENCE /DOCKET NUMBER: 17634-000510 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 206-467-9600 

(B) TELEFAX: 415-576-0300 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGCGAAAAA ATGCGTACAA CAAACTTGCA TAAACCAAAA AAATGGGGCA AATAAGAATT 60 

TGATAA.GTAC CACTTAAATT TAACTCCCTT GGTTAGAGAT GGGCAGCAAT TCATTGAGTA 120 

TGATAAAAGT TAGATTACAA AATTTGTTTG ACAATGATGA AGTAGCATTG TTAAAAATAA 180 

CATGCTATAC TGATAAATTA ATACATTTAA CTAATGCTTT GGCTAAGGCA GTGATACATA 240 

CAATCAAATT GAATGGCATT GTGTTTGTGC ATGTTATTAC AAGTAGTGAT ATTTGCCCTA 300 

ATAATAATAT TGTAGTAAAA TCCAATTTCA CAACAATGCC AGTACTACAA AATGGAGGTT 360 

ATATATGGGA AATGATGGAA TTAACACATT GCTCTCAACC TAATGGTCTA CTAGATGACA 420 

ATTGTGAAAT TAAATTCTCC AAAAAACTAA GTGATTCAAC AATGACCAAT TATATGAATC 480 

AATTATCTGA ATTACTTGGA TTTGATCTTA ATCCATAAAT TATAATTAAT ATCAACTAGC 540 

AAATCAATGT CACTAACACC ATTAGTTAAT ATAT^CTTA ACAGAAGACA AAAATGGGGC 600 

AAATAAATCA ATTCAGCCAA CCCAACCATG GACACAACCC ACAATGATAA TACACCACAA 660 

Q AGACTGATGA TCACAGACAT GAGACCGTTG TCACTTGAGA CCATAATAAC ATCACTAACC 720 

r ? r 

S AGAGACATCA TAACACACAA ATTTATATAC TTGATAAATC ATGAATGCAT AGTGAGAAAA 780 

^ CTTGATGAAA AGCAGGCCAC ATTTACATTC CTGGTCAACT ATGAAATGAA ACTATTACAC 840 

£ AAAGTAGGAA GCACTAAATA TAAAAAATAT ACTGAATACA ACACAAAATA TGGCACTTTC 900 

rj CCTATGCCAA TATTCATCAA TCATGATGGG TTCTTAGAAT GCATTGGCAT TAAGCCTACA 960 

^ AAGCATACTC CCATAATATA CAAGTATGAT CTCAATCCAT AAATTTCAAC ACAATATTCA 1020 

n CACAATCTAA AACAACAACT CTATGCATAA CTATACTCCA TAGTCCAGAT GGAGCCTGAA 1080 

AATTATAGTA ATTTAAAACT TAAGGAGAGA TATAAGATAG AAGATGGGGC AAATACAACC 1140 

Tq ATGGCTCTTA GCAAAGTCAA GTTGAATGAT ACACTCAACA AAGATCAACT TCTGTCATCC 12 00 

AGCAAATACA CCATCCAACG GAGCACAGGA GATAGTATTG ATACTCCTAA TTATGATGTG 1260 

CAGAAACACA TCAATAAGTT ATGTGGCATG TTATTAATCA CAGAAGATGC TAATCATAAA 132 0 

TTCACTGGGT TAATAGGTAT GTTATATGCG ATGTCTAGGT TAGGAAGAGA AGACACCATA 1380 

AAAATACTCA GAGATGCGGG ATATCATGTA AAAGCAAATG GAGTAGATGT AACAACACAT 1440 

CGTCAAGACA TTAATGGAAA AGAAATGAAA TTTGAAGTGT TAACATTGGC AAGCTTAACA 1500 

ACTGAAATTC AAATCAACAT TGAGATAGAA TCTAGAAAAT CCTACAAAAA AATGCTAAAA 1560 

GAAATGGGAG AGGTAGCTCC AGAATACAGG CATGACTCTC CTGATTGTGG GATGATAATA 1620 

TTATGTATAG CAGCATTAGT AATAACTAAA TTAGCAGCAG GGGACAGATC TGGTCTTACA 1680 

GCCGTGATTA GGAGAGCTAA TAATGTCCTA AAAAATGAAA TGAAACGTTA CAAAGGCTTA 1740 

CTACCCAAGG ACATAGCCAA CAGCTTCTAT GAAGTGTTTG AAAAACATCC CCACTTTATA 1800 

GATGTTTTTG TTCATTTTGG TATAGCACAA TCTTCTACCA GAGGTGGCAG TAGAGTTGAA 1860 

GGGATTTTTG CAGGATTGTT TATGAATGCC TATGGTGCAG GGCAAGTGAT GTTACGGTGG 1920 

GGAGTCTTAG CAAAATCAGT TAAAAATATT ATGTTAGGAC ATGCTAGTGT GCAAGCAGAA 1980 

ATGGAACAAG TTGTTGAGGT TTATGAATAT GCCCAAAAAT TGGGTGGTGA AGCAGGATTC 2040 
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TACCATATAT TGAACAACCC AAAAGCATCA TTATTATCTT TGACTCAATT TCCTCACTTC 2100 

TCCAGTGTAG TATTAGGCAA TGCTGCTGGC CTAGGCATAA TGGGAGAGTA CAGAGGTACA 216 0 

CCGAGGAATC AAGATCTATA TGATGCAGCA AAGGCATATG CTGAACAACT CAAAGAAAAT 2220 

GGTGTGATTA ACTACAGTGT ACTAGACTTG ACAGCAGAAG AACTAGAGGC TATCAAACAT 2280 

CAGCTTAATC CAAAAGATAA TGATGTAGAG CTTTGAGTTA ATAAAAAATG GGGCAAATAA 2340 

ATCATCATGG AAAAGTTTGC TCCTGAATTC CATGGAGAAG ATGCAAACT^ CAGGGCTACT 2400 

AAATTCCTAG T^TCAATAAA GGGCAAATTC ACATCACCCA AAGATCCCAA GAAAAAAGAT 2460 

AGTATCATAT CTGTCAACTC AATAGATATA GAAGTAACCA T^GAAAGCCC TATAACATCA 2520 

AATTCAACTA TTATCAACCC AACAAATGAG ACAGATGATA CTGCAGGGAA CTU^GCCCAAT 2580 

TATCAAAGAA AACCTCTAGT AAGTTTCAAA GAAGACCCTA CACCAAGTGA TAATCCCTTT 2640 

TCTAAACTAT ACAAAGAAAC CATAGAAACA TTTGATAACA ATGAAGAAGA ATCCAGCTAT 2700 

TCATACGAAG AAATAAATGA TCAGACAAAC GATAATATAA CAGCAAGATT AGATAGGATT 2760 

GATGAAAAAT TAAGTGAAAT ACTAGGAATG CTTCACACAT TAGTAGTGGC AAGTGCAGGA 2820 

CCTACATCTG CTCGGGATGG TATAAGAGAT GCCATGGTTG GTTTAAGAGA AGAAATGATA 2880 

GAAAAAATCA GAACTGAAGC ATTAATGACC AATGACAGAT TAGAAGCTAT GGCAAGACTC 2940 

T: AGGAATGAGG AAAGTGAAAA GATGGCAAAA GACACATCAG ATGAAGTGTC TCTCAATCCA 3000 

lu ACATCAGAGA AATTGAACAA CCTATTGGAA GGGAATGATA GTGACAATGA TCTATCACTT 3060 

GAAGATTTCT GATTAGTTAC CAATCTTCAC ATCAACACAC AATACCAACA GAAGACCAAC 3120 

Sj AAACTAACCA ACCCAATCAT CCAACCAAAC ATCCATCCGC CAATCAGCCA AACAGCCAAC 3180 

^ AAAACAACCA GCCAATCCAA AACTAACCAC CCGGAAAAAA TCTATAATAT AGTTACAAAA 3240 

0 AAAGGT^AAGG GTGGGGCAAA TATGGAAACA TACGTGAACA AGCTTCACGA AGGCTCCACA 3300 

TACACAGCTG CTGTTCAATA CAATGTCTTA GAAAAAGACG ATGACCCTGC ATCACTTACA 3360 

ATATGGGTGC CCATGTTCCA ATCATCTATG CCAGCAGATT TACTTATAAA AGAACTAGCT 3420 

AATGTCAACA TACTAGTGAA ACAAATATCC ACACCCAAGG GACCTTCACT AAGAGTCATG 3480 

ATAAACTCAA GAAGTGCAGT GCTAGCACAA ATGCCCAGCA AATTTACCAT ATGCGCTAAT 3540 

GTGTCCTTGG ATGAAAGAAG CAAACTAGCA TATGATGTAA CCACACCCTG TGAAATCAAG 3600 

GCATGTAGTC TAACATGCCT T^AAATCAAAA AATATGTTGA CTACAGTTAA AGATCTCACT 3660 

ATGAAGACAC TCAACCCTAC ACATGATATT ATTGCTTTAT GTGAATTTGA T^AACATAGTA 3 720 

AC7VTCAAAAA AAGTCATAAT ACCAACATAC CTAAGATCCA TCAGTGTCAG AAATAAAGAT 3780 

CTGAACACAC TTGAAAATAT AACAACCACT GAATTCAAAA ATGCTATCAC AAATGCAAAA 3840 

ATCATCCCTT ACTCAGGATT ACTATTAGTC ATCACAGTGA CTGACAACAA AGGAGCATTC 3900 

AAATACATAA AGCCACAAAG TCAATTCATA GTAGATCTTG GAGCTTACCT AGAAAAAGAA 3 960 

AGTATATATT ATGTTACCAC AAATTGGAAG CACACAGCTA CACGATTTGC AATCAAACCC 402 0 

ATGGAAGATT AACCTTTTTC CTCTACATCA GTGTGTTAAT TCATACAAAC TTTCTACCTA 4080 

CATTCTTCAC TTCACCATCA CAATCACAAA CACTCTGTGG TTCAACCAAT CAAACAAAAC 4140 
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TTATCTGAAG TCCCAGATCA TCCCAAGTCA TTGTTTATCA GATCTAGTAC TCAAATAAGT 4200 

TAATAAAAAA TATACACATG GGGCAAATAA TCATTGGAGG AAATCCAACT AATCACAATA 4260 

TCTGTTAACA TAGACAAGTC CACACACCAT ACAGAATCAA CCAATGGAAA ATACATCCAT 432 0 

AACAATAGAA TTCTCAAGCA AATTCTGGCC TTACTTTACA CTAATACACA TGATCACAAC 4380 

AATAATCTCT TTGCTAATCA TAATCTCCAT CATGATTGCA ATACTAAACA AACTTTGTGA 4440 

ATATAACGTA TTCCATAACA AAACCTTTGA GTTACCAAGA GCTCGAGTCA ACACATAGCA 4500 

TTCATCAATC CAACAGCCCA AAACAGTAAC CTTGCATTTA AAAATGAACA ACCCCTACCT 4560 

CTTTACAACA CCTCATTAAC ATCCCACCAT GCAAACCACT ATCCATACTA TAAAGTAGTT 462 0 

AATTAAAAAT AGTCATAACA ATGAACTAGG ATATCAAGAC TAACAATAAC ATTGGGGCAA 4680 

ATGCAAACAT GTCCAAAAAC AAGGACCAAC GCACCGCTAA GACATTAGAA AGGACCTGGG 4740 

ACACTCTCAA TCATTTATTA TTCATATCAT CGTGCTTATA TAAGTTAAAT CTTAAATCTG 4800 

TAGCACAAAT CACATTATCC ATTCTGGCAA TGATAATCTC AACTTCACTT ATAATTGCAG 4860 

CCATCATATT CATAGCCTCG GCAAACCACA AAGTCACACC AACAACTGCA ATCATACAAG 4 92 0 

ATGCAACAAG CCAGATCAAG AACACAACCC CAACATACCT CACCCAGAAT CCTCAGCTTG 4 980 

GAATCAGTCC CTCTAATCCG TCTGAAATTA CATCACAAAT CACCACCATA CTAGCTTCAA 5040 

CAACACCAGG AGTCAAGTCA ACCCTGCAAT CCACAACAGT CAAGACCAAA AACACAACAA 5100 

CAACTCAAAC ACAACCCAGC AAGCCCACCA CAAAACAACG CCAAAACAAA CCACCAAGCA 5160 

AACCCAATAA TGATTTTCAC TTTGAAGTGT TCAACTTTGT ACCCTGCAGC ATATGCAGCA 5220 

ACAATCCAAC CTGCTGGGCT ATCTGCAAAA GAATACCAAA CAAAAAACCA GGAAAGAAAA 5280 

CCACTACCAA GCCCACAAAA AAACCAACCC TCAAGACAAC CAAAAAAGAT CCCAAACCTC 5340 

AAACCACTAA ATCAAAGGAA GTACCCACCA CCAAGCCCAC AGAAGAGCCA ACCATCAACA 5400 

CCACCAAAAC AAACATCATA ACTACACTAC TCACCTCCAA CACCACAGGA AATCCAGAAC 5460 

TCACAAGTCA AATGGAAACC TTCCACTCAA CTTCCTCCGA AGGCAATCCA AGCCCTTCTC 5520 

AAGTCTCTAC AACATCCGAG TACCCATCAC AACCTTCATC TCCACCCAAC ACACCACGCC 5580 

AGTAGTTACT TAAAAACATA TTATCACAAA AGGCCTTGAC CAACTTAAAC AGAATCAAAA 5640 

TAAACTCTGG GGCAAATAAC AATGGAGTTG CTAATCCTCA AAGCAAATGC AATTACCACA 5700 

ATCCTCACTG CAGTCACATT TTGTTTTGCT TCTGGTCAAA ACATCACTGA AGAATTTTAT 5760 

CAATCAACAT GCAGTGCAGT TAGCAAAGGC TATCTTAGTG CTCTGAGAAC TGGTTGGTAT 582 0 

ACCAGTGTTA TAACTATAGA ATTAAGTAAT ATCAAGAAAA ATAAGTGTAA TGGAACAGAT 5880 

GCTAAGGTAA AATTGATAAA ACAAGAATTA GATAAATATA AAAATGCTGT AACAGAATTG 5940 

CAGTTGCTCA TGCAAAGCAC ACAAGCAACA AACAATCGAG CCAGAAGAGA ACTACCAAGG 6000 

TTTATGAATT ATACACTCAA CAATGCCAAA AAAACCAATG TAACATTAAG CAAGAAAAGG 6060 

AAAAGAAGAT TTCTTGGTTT TTTGTTAGGT GTTGGATCTG CAATCGCCAG TGGCGTTGCT 6120 

GTATCTAAGG TCCTGCACCT AGAAGGGGAA GTGAACAAGA TCAAAAGTGC TCTACTATCC 6180 

ACAAACAAGG CTGTAGTCAG CTTATCAAAT GGAGTTAGTG TTTTAACCAG CAAAGTGTTA 6240 
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GACCTCAAAA 

ATATCAAATA 

ACCAGGGAAT 

AATAGTGAAT 

ATGTCCAACA 

GAGGAAGTCT 

TGGAAACTAC 

TTAACAAGAA 

CAAGCTGAAA 

ACATTACCAA 

AAAATTATGA 

GTGTCATGCT 

ACATTTTCTA 

AACACATTAT 

ATAATAAATT 

CAAGTCAACG 

CATAATGTAA 

ATTATAGTAA 

AGCACACCAG 

AACTAAATAA 

AACCCATCTG 

CCATCTCACT 

ATGCCAGATT 

TCCTTGCAAA 

TAATTATTTT 

AATACTTAAG 

GTTGGACAGA 

ATCAATAAAC 

ACTCAATAGT 

AAGAGTGTAC 

TATCCATCTG 

GGATATCCAT 

TGACCATGCC 

TAATAACAAG 

AAAACAACCC 



ACTATATAGA 

TAGAAACTGT 

TTAGTGTTAA 

TATTGTCATT 

ATGTTCAAAT 

TAGCATATGT 

ACACATCCCC 

CTGACAGAGG 

CATGTAAAGT 

GTGAAGTAAA 

CTTCAAAAAC 

ATGGCAAAAC 

ACGGGTGCGA 

ATTATGTAAA 

TCTATGACCC 

AGAAGATTAA 

ATGCTGGTAA 

TATTGTTATC 

TCACACTAAG 

AAATAGCACC 

TCATTGGATT 

TACACTATTT 

AACTTACCAT 

TTTGAAATTC 

GAATGGCCAC 

TCTATGGATA 

ACAGAAGAGT 

AATATAACTA 

GATGATATCA 

AATACTGTCA 

TTAT^AAAGAT 

AAGAGCATAA 

AAAAATAATG 

TAGATGTAGA 

AAATAACCAT 



TAAACAATTG 

GATAGAGTTC 

TGCAGGCGTA 

AATCAATGAT 

AGTTAGACAG 

AGTACAATTA 

TCTATGTACA 

ATGGTACTGT 

TCT^TCAAAT 

TCTCTGCAAT 

AGATGTAAGC 

TAAATGTACA 

TTATGTATCA 

TAAGCAAGAA 

ATTAGTATTC 

CCAGAGCCTA 

ATCCACCACA 

ATTAATTGCT 

CAAAGATCAA 

TAATCATGTT 

TTCTTAAAAT 

AAGTAGATTC 

CTGTAAAAAT 

GAGGTCATTG 

CCCATGCACT 

AAAGTATAGA 

ATGCTCTTGG 

AACAATCAGC 

AAAAGCTGAG 

TATCATATAT 

TGCCAGCAGA 

CCATCAACAA 

ATACTACCTG 

GTTACTATGT 

ATGTACTCAC 



TTACCTATTG 

CAACAAAAGA 

ACTACACCTG 

ATGCCTATAA 

CAAAGTTACT 

CCACTATATG 

ACCAACACAA 

GACAATGCAG 

CGAGTATTTT 

GTTGACATAT 

AGCTCCGTTA 

GCATCCAATA 

AATAAAGGGG 

GGTAAAAGTC 

CCCTCTGATG 

GCATTTATTC 

AATATCATGA 

GTTGGACTGC 

CTGAGTGGTA 

CTTACAATGG 

CTGAACTTCA 

CTAGTTTATA 

GAAAACTGGG 

CTTAAATGGT 

GCTTGTAAGA 

TACCTTATCA 

TGTAGTTGGA 

ATGTGTTGCC 

GGACAATGAA 

TGAAAGCAAC 

CGTATTGAAG 

CCCAAAAGAA 

ACAAATATCC 

ATAATCAAAA 

CGAATCAAAC 



TGAACAAGCA 
ACAACAGACT 
TAAGCACTTA 
CAAATGATCA 
CTATCATGTC 
GTGTTATAGA 
AAGAAGGGTC 
GATCAGTATC 
GTGACACAAT 
TCAACCCCAA 
TCACATCTCT 
AAAATCGTGG 
TGGACACTGT 
TCTATGTAAA 
AATTTGATGC 
GTAAATCCGA 
TAACTACTAT 
TCTTATACTG 
TAAATAATAT 
TTTACTATCT 
TCGAAACTCT 
GTTATATAAA 
GCAAATATGT 
AAGAGGTGTC 
CAAAACTTTA 
GAAATAAGTG 
GTGCTAGAGA 
ATGAGCAAAC 
GAGCTAAATT 
AGGAAAAACA 
AAAACCATCA 
TCAACTGTTA 
TTGTAGTATA 
GAACACACTA 
ATTCAATGAA 



AAGCTGCAGC 

ACTAGAGATT 

CATGTTAACT 

GAAAAAGTTA 

CATAATAAAA 

TACACCCTGT 

CAACATCTGT 

TTTCTTCCCA 

GAACAGTTTA 

ATATGATTGT 

AGGAGCCATT 

AATCATAAAG 

GTCTGTAGGT 

AGGTGAACCA 

ATCAATATCT 

TGAATTATTA 

AATTATAGTG 

TAAGGCCAGA 

TGCATTTAGT 

GCTCATAGAC 

CATCTATAAA 

ACACAATTGC 

CACGAAGGAA 

ATTTTAGTCA 

TGTTAAACAG 

GAGCTGCAGA 

GTTATATAGG 

TCCTCACTGA 

CACCCAAGAT 

ATAAACAAAC 

AAAACACATT 

GTGATACAAA 

ACTTCCATAC 

TATTTCAATC 

ATCCATTGGA 



6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 




CCTCTCAAGA ATTGATTGAC ACAATTCAAA 
ATATATATAC AATATATATA TTAGTGTCAT 
TACATTATTA ATTCAAACAA TTCAAGTTGT 
ATTCTGCTAA TGTTTATCTA ACCGATAGTT 
GTAATGCTTT AGGAAGTTAC ATATTCAATG 
TAATTAGTAG ACAAAATCCA TTAATAGAAC 
AGTCCTTAAT ATCTAAGTAT CATAAAGGTG 
AGTCATTACT TATGACATAC AAGAGTATGA 
TACTTAAAAA GATAATAAGA AGAGCTATAG 
TGAATAAACT AGGGCTTAAA GAAAAGGACA 
ACAACTCAGT TATTACGACC ATAATCAAAG 
AATCTCATCT TAAAGCAGAC AAAAATCACT 
CACTCTTGAA GAAATTGATG TGTTCAATGC 
g| TTAACTTATA CACAAAATTA AACAACATAT 

21 ACCATGGGTT TACATTGATA GATAATCAAA 

4? AATATGGTTG TATAGTTTAT CATAAGGAAC 

AATTCTTGAC ATGGAAAGAT ATTAGCCTTA 
TTAGTAACTG CTTGAACACA TTAAATTU^AA 
%i TTATCTTGAC ACAACTATTC CTTTATGGAG 

GGTTCTACAT AATAAAAGAG GTAGAGGGAT 
^Xl AAGAAGATCA ATTCAGAAAA CGATTTTATA 

CTAATAAAGC TC7VGAAAAAT CTGCTATCAA 
TGTCCGATAA TATAATAAAT GGCAGATGGA 
TTAAGCTTGC AGGTGACAAT AACCTTAACA 
TATTTGGACA CCCAATGGTA GATGAAAGAC 
ATGAGACCAA ATTTTACTTG TTAAGCAGTC 
GAATTATAAA AGGGTTTGTA AATAATTACA 
TTTTACCCTT AAGATGGTTA ACTTACTATA 
TTACAGAAAG AGATTTGATT GTGTTATCAG 
CTAAAAAAGT GGATCTTGAA ATGATTATAA 
TGATATGGAC TAGTTTCCCT AGAAATTACA 
ATGAAAAATT AAAATTTTCC GAGAGTGATA 
GAGATAACAA ATTCAATGAA TGTGATTTAT 
ACAACCCTAA TCATGTGGTA TCATTGACAG 
TGTTTGCAAT GCAACCGGGA ATGTTCAGAC 



m 

ATTTTCTACA ACATCTAGGT ATTATTGAGG 8400 

AACACTCAAT TCTAACACTC ACCACATCGT 8460 

GGGACAAAAT GGATCCCATT ATTAATGGAA 8520 

ATTTAAAAGG TGTTATCTCT TTCTCAGAGT 8580 

GTCCTTATCT CAAAAATGAT TATACCAACT 8640 

ACATGAATCT AAAGAAACTA AATATAACAC 8700 

AAATAAAATT AGAAGAACCT ACTTATTTTC 8760 

CCTCGTCAGA ACAGATTGCT ACCACTAATT 8820 

AAATAAGTGA TGTCAAAGTC TATGCTATAT 8880 

AGATTAAATC CAACAATGGA CAAGATGAAG 8940 

ATGATATACT TTCAGCTGTT AAAGATAATC 9000 

CTACAAAACA AAAAGACACA ATCAAAACAA 9060 

AACATCCTCC ATCATGGTTA ATACATTGGT 912 0 

TAACACAGTA TCGATCAAAT GAGGTAAAAA 9180 

CTCTTAGTGG ATTTCAATTT ATTTTGAACC 9240 

TCAAAAGAAT TACTGTGACA ACCTATAATC 9300 

GTAGATTAAA TGTTTGTTTA ATTACATGGA 9360 

GCTTAGGCTT AAGATGCGGA TTCAATAATG 942 0 

ATTGTATACT AAAGCTATTT CACAATGAGG 9480 

TTATTATGTC TCTAATTTTA AATATAACAG 954 0 

ATAGTATGCT CAACAACATC ACAGATGCTG 9600 

GAGTATGTCA TACATTATTA GATAAGACAG 9660 

TAATTCTATT AAGTAAGTTC CTTAAATTT^ 9720 

ATCTGAGTGA ACTATATTTT TTGTTCAGAA 9780 

AAGCCATGGA TGCTGTTAAA ATTAATTGCA 9840 

TGAGTATGTT AAGAGGTGCC TTTATATATA 9900 

ACAGATGGCC TACTTTAAGA AATGCTATTG 9960 

AACTAAACAC TTATCCTTCT TTGTTGGAAC 10020 

GACTACGTTT CTATCGTGAG TTTCGGTTGC 10080 

ATGATAAAGC TATATCACCT CCTAAAAATT 10140 

TGCCATCACA CATACAAAAC TATATAGAAC 10200 

AATC7VAGAAG AGTATTAGAG TATTATTTAA 10260 

ACAACTGTGT AGTTAATCAA AGTTATCTCA 10320 

GCAAAGAAAG AGAACTCAGT GTAGGTAGAA 10380 

AGGTTCAAAT ATTGGCAGAG AAAATGATAG 10440 
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CTGAAAACAT 


TTTACAATTC 


TTTCCTGJU^ 


GTCTTACAAG 


ATATGGTGAT 


CTAGAACTAC 


10500 


AAAAAATATT 


AGAACTGAAA 


GCAGGAATAA 


GTAACAAATC 


AAATCGCTAC 


AATGATAATT 


10560 


ACAACAATTA 


CATTAGTAAG 


TGCTCTATCA 


TCACAGATCT 


CAGCAAATTC 


AATCAAGCAT 


10620 


TTCGATATGA 


AACGTCATGT 


ATTTGTAGTG 


ATGTGCTGGA 


TGAACTGCAT 


GGTGTACAAT 


10680 


CTCTATTTTC 


CTGGTTACAT 


TTAACTATTC 


CTCATGTCAC 


AATAATATGC 


ACATATAGGC 


10740 


ATGCACCCCC 


CTATATAGGA 


GATCATATTG 


TAGATCTTAA 


CAATGTAGAT 


GAACAAAGTG 


10800 


GATTATATAG 


ATATCACATG 


GGTGGCATCG 


AAGGGTGGTG 


TCAAAAACTA 


TGGACCATAG 


10860 


AAGCTATATC 


ACTATTGGAT 


CTAATATCTC 


TCATUVGGGAA 


ATTCTCAATT 


ACTGCTTTAA 


10920 


TTAATGGTGA 


CAATCAATCA ATAGATATAA GCAAACCAAT 


CAGACTCATG 


GAAGGTCAAA 


10980 


CTCATGCTCA 


AGCAGATTAT 


TTGCTAGCAT 


TAAATAGCCT 


TAAATTACTG 


TATAAAGAGT 


11040 


ATGCAGGCAT 


AGGCCACAAA 


TTAAAAGGAA 


CTGAGACTTA 


TATATCACGA 


GATATGCAAT 


11100 


TTATGAGTAA 


AACAATTCAA 


CATAACGGTG 


TATATTACCC 


AGCTAGTATA 


AAGAAAGTCC 


11160 


TAAGAGTGGG 


ACCGTGGATA 


AACACTATAC 


TTGATGATTT 


CAAAGTGAGT 


CTAGAATCTA 


11220 


TAGGTAGTTT 


GACACAAGAA 


TTAGAATATA 


GAGGTGAAAG 


TCTATTATGC 


AGTTTAATAT 


11280 


TTAGAAATGT 


ATGGTTATAT 


AATCAGATTG 


CTCTACAATT 


AAAAAATCAT 


GCATTATGTA 


11340 


ACAATAAACT 


ATATTTGGAC 


ATATTAAAGG 


TTCTGAAACA 


CTTAAAAACC 


•rriTTTAATC 


11400 


TTGATAATAT 


TGATACAGCA 


TTAACATTGT 


ATATGAATTT 


ACCCATGTTA 


TTTGGTGGTG 


11460 


GTGATCCCAA 


CTTGTTATAT 


CGAAGTTTCT 


ATAGAAGAAC 


TCCTGACTTC 


CTCACAGAGG 


11520 


CTATAGTTCA 


CTCTGTGTTC 


ATACTTAGTT 


ATTATACAAA 


CCATGACTTA 


AAAGATAAAC 


11580 


TTCAAGATCT 


GTCAGATGAT 


AGATTGAATA 


AGTTCTTAAC 


ATGCATAATC 


ACGTTTGACA 


11640 


AAAACCCTAA 


TGCTGAATTC 


GTAACATTGA 


TGAGAGATCC 


TCAAGCTTTA 


GGGTCTGAGA 


11700 


GACAAGCTAA 


AATTACTAGC 


GAAATCAATA 


GACTGGCAGT 


TACAGAGGTT 


TTGAGTACAG 


11760 


CTCCAAACAA 


AATATTCTCC 


AAAAGTGCAC 


AACATTATAC 


TACTACAGAG 


ATAGATCTAA 


11820 


ATGATATTAT 


GCAAAATATA 


GAACCTACAT 


ATCCTCATGG 


GCTAAGAGTT 


GTTTATGAAA 


11880 


GTTTACCCTT 


TTATAAAGCA 


GAGAAAATAG 


TAAATCTTAT 


ATCAGGTACA 


AAATCTATAA 


11940 


CTAACATACT 


GGAAAAAACT 


TCTGCCATAG 


ACTTAAC7VGA 


TATTGATAGA 


GCCACTGAGA 


12000 


TGATGAGGAA 


AAACATAACT 


TTGCTTATAA 


GGATACTTCC 


ATTGGATTGT 


AACAGAGATA 


12060 


AAAGAGAGAT 


ATTGAGTATG 


GAAT^CCTAA 


GTATTACTGA 


ATTAAGC7\AA 


TATGTTAGGG 


12120 


AAAGATCTTG 


GTCTTTATCC 


AATATAGTTG 


GTGTTACATC 


ACCCAGTATC 


ATGTATACAA 


12180 


TGGACATCAA 


ATATACTACA 


AGCACTATAT 


CTAGTGGCAT 


AATTATAGAG 


AAATATAATG 


12240 


TTAACAGTTT 


AACACGTGGT 


GAGAGAGGAC 


CCACTAAACC 


ATGGGTTGGT 


TCATCTACAC 


12300 


AAGAGAAAAA 


AACAATGCCA 


GTTTATAATA 


GACAAGTCTT 


AACCAAAAAA 


CAGAGAGATC 


12360 


AAATAGATCT 


ATTAGCAAAA 


TTGGATTGGG 


TGTATGCATC 


TATAGATAAC 


AAGGATGAAT 


12420 


TCATGGAAGA 


ACTCAGCATA 


GGAACCCTTG 


GGTTAACATA 


TGAAAAGGCC 


AAGAAATTAT 


12480 


TTCCACAATA 


TTTAAGTGTC 


AATTATTTGC 


ATCGCCTTAC 


AGTCAGTAGT 


AGACCATGTG 


12540 



AATTCCCTGC 

TTAATCGCAT 

GTATAAGCTT 

ACAGAATTAT 

CAGGTGATGT 

CAGACAAAAT 

CTGGATCTCA 

ATACTTACAT 

AAGATTCTAA 

TTATTAATTT 

GTTATGGCAA 

AATTAATAGA 

TCAAATACAT 

AATTATGGTT 

ACATAGATTA 

TGGGATTGAT 

TTTATACTTC 

CTAAACATAT 

CTACACCAGA 

CACTGAATGA 

ATAAGAAGCT 

ATAATTTATT 

AATCCAACCA 

CACTTTACTG 

CAGGTTGTAA 

GTATAGCATT 

ATCCTGACAT 

TTGAGTTTTT 

TTCCTGCTAC 

AACCTATCAG 

TTATAATAGA 

GTATGTTAAT 

CTATATTAAA 

TCCTTACAAT 

TGATACTATC 



ATCAATACC7V 

ATTAACAGAA 

TGGCCTTAGT 

TCTCATACCT 

TGATATTCAC 

AAGTTTGACT 

TGTTAATTCT 

TTTAAGTACT 

AGGTATTTTT 

GAAAGTTTTC 

AGCAAAGCTG 

CAGTAGTTAT 

TCTTAGCCAA 

TCTTAAACGT 

TCATCCAACA 

AAATATAGAT 

TT^TCTCTTC 

AAGGATTGCT 

AACCCTAGAG 

CTATTGTATA 

TATTAAATCG 

CCCTATGGTT 

ACTTTACACT 

CATGCTTCCT 

AATTAGTATA 

CATAGGTGAA 

AAGATATATT 

AAGGCTGTAC 

AGATGCAACC 

TCTTTTTGTC 

ATGQAGCAAG 

AGTAAAATAT 

AACTTATGTA 

AGGTCCTGCG 

AAGAACCAAA 



GCTTATAGAA 
AAGTATGGTG 
TTAATGTCAG 
AAGCTTAATG 
AAGTTAAAAC 
CAATATGTGG 
AATTTAATAT 
AATTTAGCTG 
GAAAAAGATT 
TTCAATGCTT 
GAGTGTGATA 
TGGAAGTCTA 
GATGCAAGTT 
CTTAATGTAG 
CATATGAAAG 
AGAATACACA 
TACATTAATT 
AATTCTGAAT 
AATATACTAG 
GGTAAAAATG 
TCTGCAATGA 
GTGATTGATA 
ACTACTTCCC 
TGGCATCATA 
GAGTATATTT 
GGAGCAGGQA 
TACAGAAGTC 
AATGGACATA 
AACAACATTC 
TGTGATGCCG 
CATGTAAGAA 
CATGCTCAAG 
TGCTTAGGCA 
AATATATTCC 
AATTTCATCA 



C7UVCAAATTA 
ATGAAGATAT 
TAGTAGAACA 
AGATACATTT 
AAGTGATACA 
T^TTATTCTT 
TGGCACATAA 
GACATTGGAT 
GGGGAGAGGG 
ATAAGACCTA 
TGAACACTTC 
TGTCTAAGGT 
TACATAGAGT 
CAGAATTCAC 
CAATATTAAC 
TTAAAAATAA 
ATAACTTCTC 
TAGAAAATAA 
CCAATCCGAT 
TTGACTCAAT 
TTAGAACCAA 
GAATTATAGA 
ACCAAATATC 
TTAATAGATT 
TAAAAGATCT 
ATTTATTATT 
TGAAAGATTG 
TCAACATTGA 
ATTGGTCTTA 
AATTGTCTGT 
AGTGCAAGTA 
ATGATATTGA 
GTAAGTTAAA 
CAGTATTTAA 
TGCCTAAGAA 



TCACTTTGAC 
TGACATAGTA 
ATTTACTAAT 
GATGAAACCT 
AAAACAGCAT 
AAGTAATAAA 
AATATCTGAC 
TCTGATTATA 
ATATATAACT 
TCTCTTGTGT 
AGATCTTCTA 
ATTTTTAGAA 
AAAAGGATGT 
AGTTTGCCCT 
TTATATAGAT 
ACACAAATTC 
AGATAATACT 
TTACAACAAA 
TAAAAGTAAT 
AATGTTACCA 
TTACAGCAAA 
TCATTCAGGC 
CTTAGTGCAC 
CAATTTTGTA 
TAAAATTAAA 
GCGTACAGTA 
CAATGATCAT 
TTATGGTGAA 
TTTACATATA 
AACAGTCAAC 
CTGTTCCTCA 
TTTCAAATTA 
GGGATCGGAG 
TGTAGTACAA 
AGCTGATAAA 



ACTAGCCCTA 

TTCCAAAACT 

GTATGTCCTA 

CCCATATTCA 

ATGTTTTTAC 

ACACTCAAAT 

TATTTTCATA 

CAACTTATGA 

GATCATATGT 

TTTCATAAAG 

TGTGTATTGG 

CAAAAAGTTA 

CATAGCTTCA 

TGGGTTGTTA 

CTTGTTAGAA 

AATGATGAAT 

CATCTATTAA 

TTATATCATC 

GACAAAAAGA 

TTGTTATCTA 

CAAGATTTGT 

AATACAGCCA 

AATAGCACAT 

TTTAGTTCTA 

GATCCCAATT 

GTGGAACTTC 

AGTTTACCTA 

AATTTGACCA 

AAGTTTGCTG 

TGGAGTAAAA 

GTTAATAAAT 

GACAATATAA 

GTTTACTTAG 

AATGCTAAAT 

GAGTCTATTG 



12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 



ATGCAAATAT TAAAAGTTTG ATACCCTTTC TTTGTTACCC 
ATACTGCATT GTCAAAACTA AAGAGTGTTG TTAGTGGAGA 
CTGGACGTAA TGAAGTTTTC AGCAATAAAC TTATAAATCA 
AATGGTTCAA TCATGTTTTA AATTTCAGAT CAACAGAACT 
TGGTAGAATC TACATATCCT TACCTAAGTG AATTGTTAAA 
TTAAAAAACT GATTAAAATC ACAGGTAGTC TGTTATACAA 
TAAAGATCTT ATAATAAAAA TTCCCATAGC TATACACTAA 
ATTAAAAATT AAAAATCATA TAATTTTTTA AATAACTTTT 
ATCATTTTAA TCTTGGAGGA ATAAATTTAA ACCCTAATCT 
AACTAAATTA CGAGATATTA GTTTTTGACA CTTTTTTTCT 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



TATAACAAAA 
TATACTATCA 
TAAGCATATG 
AAACTATAAC 
CAGCTTGACA 
CTTTCATAAT 
CACTGTATTC 
AGTGAACTAA 
AATTGGTTTA 
CGT 



AAAGGAATTA 
TATTCTATAG 
AACATCTTAA 
CATTTATATA 
ACCAATGAAC 
GAATAATGAA 
AATTATAGTT 
TCCTAAAGTT 
TATGTGTATT 



14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15223 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ACGCGAAAAA ATGCGTACTA CAAACTTGCA CATTCGGAAA AAATGGGGCA AATAAGAATT 60 

TGATAAGTGC TATTTAAGTC TAACCTTTTC TU^TCAGAAAT GGGGTGCAAT TCACTGAGCA 120 

TGATAAAGGT TAGATTACAA AATTTATTTG ACAATGACGA AGTAGCATTG TTAAAAATAA 18 0 

CATGTTATAC TGACAAATTA ATTCTTCTGA CCAATGCATT AGCCAAAGCA GCAATACATA 240 

CAATTAAATT AAACGGTATA GTTTTTATAC ATGTTATAAC AAGCAGTGAA GTGTGCCCTG 300 

ATAACAACAT TGTAGTAAAA TCTAACTTTA CAACAATGCC AATATTACAA AACGGAGGAT 360 

ACATATGGGA ATTGATTGAG TTGACACACT GCTCTCAATT AAACGGTCTA ATGGATGATA 420 

ATTGTGAAAT CAAATTTTCT AAAAGACTAA GTGACTCAGT AATGACTAAT TATATGAATC 480 

AAATATCTGA TTTACTTGGG CTTGATCTCA ATTCATGAAT TATGTTTAGT CTAACTCAAT 540 

AGACATGTGT TTATTACCAT TTTAGTTAAT ATAAAAACTC ATCAAAGGGA AATGGGGCAA 600 

ATAAACTCAC CTAATCAATC TU^CTATGAG CACTACAAAT GACAACACTA CTATGCAAAG 660 

ATTAATGATC ACGGACATGA GACCCCTGTC GATGGATTCA ATAATAACAT CTCTCACCAA 720 

AGA7ATCATC ACACACAAAT TCATATACTT GATAAACAAT GAATGTATTG TAAGAAAACT 780 

TGATGAAAGA CAAGCTACAT TTACATTCTT AGTCAATTAT GAGATGAAGC TACTGCACAA 840 

AGTAGGGAGT ACCAAATACA AGAAATACAC TGAATATAAT ACAAAATATG GCACTTTCCC 900 

CATGCCTATA TTTATCAATC ATGGCGGGTT TCTAGAATGT ATTGGCATTA AGCCTACAAA 96 0 

ACACACTCCT ATAATATACA AATATGACCT CAACCCGTAA ATTCCAACAA AAAAAACCAA 1020 

CCCAACCAAA CCTUVGCTATT CCTCAAACAA CAATGCTCAA TAGTTAAGAA GGAGCTAATC 1080 




CGTTTTAGTA 


ATTAAAAATA 


AAAGTAAAGC 


CAATAACATA 


AATTGGGGCA 


T^TACAAAGA 


1140 


TGGCTCTTAG 


CAAAGTCAAG 


TTAAATGATA 


CATTAAATAA 


GGATCAGCTG 


CTGTCATCCA 


1200 


GCAAATACAC 


TATTCAACGT 


AGTACAGGAG 


ATAATATTGA 


CACTCCCAAT 


TATGATGTGC 


1260 


AAAAACACCT 


AAACAAACTA 


TGTGGTATGC 


TATTAATCAC 


TGAAGATGCA 


AATCATAT^T 


1320 


TCACAGGATT 


AATAGGTATG 


TTATATGCTA 


TGTCCAGGTT 


AGGAAGGGAA 


GACACTATAA 


1380 


AGATACTTAA 


AGATGCTGGA 


TATCATGTTA 


AAGCTAATGG 


AGTAGATATA 


ACAACATATC 


1440 


GTCAAGATAT 


AAATGGAAAG 


GAAATGAAAT 


TCGAAGTATT 


AACATTATCA 


AGCTTGACAT 


1500 


CAGAAATACA 


AGTCAATATT 


GAGATAGAAT 


CTAGAAAATC 


CTACAAAAAA 


ATGCTAAAAG 


1560 


AGATGGGAGA 


AGTGGCTCCA 


GAATATAGGC 


ATGATTCTCC 


AGACTGTGGG 


ATGATAATAC 


1620 


TGTGTATAGC 


AGCACTTGTA 


ATAACCAAAT 


TAGCAGCAGG 


AGACAGATCA 


GGTCTTACAG 


1680 


CAGTAATTAG 


GAGGGCAAAC 


AATGTCTTAA 


AAAATGAAAT 


AAAACGCTAC 


AAGGGTCTCA 


1740 


TACCAAAGGA 


TATAGCTAAC 


AGTTTTTATG 


AAGTGTTTGA 


AAAACACCCT 


CATCTTATAG 


1800 


ATGTTTTTGT 


GCACTTTGGC 


ATTGCACAAT 


CATCAACAAG 


AGGGGGTAGT 


AGAGTTGAAG 


1860 


GAATCTTTGC 


AGGATTGTTT 


ATGAATGCCT 


ATGGTTCAGG 


GCAAGTAATG 


CTAAGATGGG 


1920 


GAGTTTTAGC 


CAAATCTGTA 


AAAAATATCA 


TGCTAGGTCA 


TGCTAGTGTC 


CAGGCAGAAA 


1980 


TGGAGCAAGT 


TGTGGAAGTC 


TATGAGTATG 


CACAGAAGTT 


GGGAGGAGAA 


GCTGGATTCT 


2040 


ACCATATATT 


GAACAATCCA 


AAAGCATCAT 


TGCTGTCATT 


AACTCAATTT 


CCTAACTTCT 


2100 


CAAGTGTGGT 


CCTAGGCAAT 


GCAGCAGGTC 


TAGGCATAAT 


GGGAGAGTAT 


AGAGGTACGC 


2160 


CAAGAAACGA 


GGATCTTTAT 


GATGCAGCCA 


AAGCATATGC 


AGAGCAACTC 


AAAGAAAATG 


2220 


GAGTAATAAA 


CTACAGTGTA 


TTAGACTTAA 


CAGCAGAAGA 


ATTGGAAGCC 


ATAAAGAATC 


2280 


AACTCAACCC 


TAAAGAAGAT 


GATGTAGAGC 


TTTAAGTTAA 


CAAAAAATAC 


GGGGCAAATA 


2340 


AGTCAACT^TG 


GAGAAGTTTG 


CACCTGAATT 


TCATGGAGAA 


GATGCAAATA 


ACAAAGCTAC 


2400 


CAAATTCCTA 


GAATCAATAA 


AGGGCAAGTT 


CGCATCATCC 


AAAGATCCTA 


AGAAGAAAGA 


2460 


TAGCATAATA 


TCTGTTAACT 


CAATAGATAT 


AGAAGTAACC 


AAAGAGAGCC 


CGATAACATC 


2520 


TGGCACCAAC 


ATCATCAATC 


CAACAAGTGA 


AGCCGACAGT 


ACCCCAGAAA 


CCAAAGCCAA 


2580 


CTACCCAAGA 


AAACCCCTAG 


TAAGCTTCT^ 


AGAAGATCTC 


ACCCCAAGTG 


ACAACCCTTT 


2640 


TTCTAAGTTG 


TACAAAGAAA 


CAATAGAAAC 


ATTTGATAAC 


AATGAAGAAG 


AATCTAGCTA 


2700 


CTCATATGT^ 


GAGATAAATG 


ATCAAACAAA 


TGACAACATT 


ACAGCAAGAC 


TAGATAGAAT 


2760 


TGATGAAAAA 


TTAAGTGAAA 


TATTAGGAAT 


GCTCCATACA 


TTAGTAGTTG 


CAAGTGCAGG 


2820 


ACCCACTTCA 


GCTCGCGATG 


GAATAAGAGA 


TGCTATGGTT 


GGTCTGAGAG 


AAGAAATGAT 


2880 


AOAAAAAATA 


AGAGCGGAAG 


CATTAATGAC 


CAATGATAGG 


TTAGAGGCTA 


TGGC7UVGACT 


2940 


TAGGAATGAG 


GAAAGCGAAA 


AAATGGCAAA 


AGACACCTCA 


GATGAAGTGC 


CTCTTAATCC 


3000 


AACTTCCAAA 


AAATTGAGTG 


ACTTGTTGGA 


AGACAACGAT 


AGTGACAATG 


ATCTGTCACT 


3060 


TGATGATTTT 


TGATCAGTGA 


TCAACTCACT 


CAGCAATCAA 


CAACATCAAT 


AAAAC7VGACA 


3120 


TCAATCCATT 


GAATCAACTG 


CCAGACCGAA 


CAAACAAATG 


TCCGTCAGCG 


GAACCACCAA 


3180 




CCAATCAATC AACCAACTGA TCCATCAGCA 
AAAAGAACAA GATGGGGCAA ATATGGAAAC 
ATACACAGCA GCTGTTCAGT ACAATGTTCT 
AATATGGGTG CCTATGTTCC AGTCATCTGT 
AAGCATCAAC ATACTAGTGA AGCAGATCTC 
GATTAACTCA AGAAGTGCTG TGCTGGCTCA 
TGTATCATTA GATGA7AGAA GCAAATTAGC 
AGCATGCAGT CTAACATGCT TAAAAGTGAA 
CATGAAGACA TTCAACCCCA CTCATGAGAT 
GACATCAAAA AGAGTAATAA TACCAACCTA 
TCTGAACTCA CTAGAAAACA TAGCAACCAC 
AATTATTCCC TATGCTGGAT TAGTATTAGT 
2 CAAATATATC AAGCCACAGA GTCAATTTAT 

^ GAGCATATAT TATGTGACTA CTAATTGGAA 

SI ACTAGAGGAT TAAATTTAAT TATCAACACT 

ly 

ACACACTATA TCCAAACATC ATGAACATCT 
|7| CCACTCAAAA TCCAAAATCA CTACCAGCCA 

L. AAATAAAACC AAAATATGGG GTAAATAGAC 

%j CATTTATACC GCCAATTCAA TACATATACT 

fl! ACAATAGAAT TCACAAGCAA ATTTTGGCCC 

CTAATCTCTT TACTAATTAT AATCACTATT 
CATAAAACAT TCTGTAACAA TACTCTTGAA 
TGCTCTACCA TCATGCTGTG TCAAATTATA 
CTTCTCACAG AGTC7VTGGTG TCGCAAAACC 
TATTTAAAAA TTAACATAAT GATGAATTAT 
AATGCAACCA TGTCCAAACA CAAGAATCAA 
GATACTCTCA ATCATCTAAT TGTAATATCC 
ATAGCACAAA TAGCACTATC AGTTCTGGCA 
GCCATAATAT TCATCATCTC TGCCAATCAC 
ACAATAAAAA ACCACACTGA AAAAAACATC 
AGGGTTAGCT CATCCAAACA ACCTACAACC 
ACATCACCCA ACACAAAGTC AGAAACAC7VC 
ACCACCTCT^ CACAGACCAA CAAGCCGAGC 
AAACCAAAAG ATGATTACCA TTTTGAAGTG 
AACAATCAAC TTTGCAAATC CATCTGTAAA 




ACCTGACGAA 


ATTAACAATA 


TAGTAACAAA 


3240 


ATACGTGAAC 


AAGCTTCACG 


AAGGCTCCAC 


3300 


AGAAAAAGAT 


GATGATCCTG 


CATCACTAAC 


3360 


ACCAGCAGAC 


TTGCTCATAA 


AAGAACTTGC 


3420 


TACGCCCAAA 


GGACCTTCAC 


TACGAGTCAC 


3480 


AATGCCTAGT 


AATTTCATCA 


TAAGCGCAAA 


3540 


ATATGATGTA 


ACTACACCTT 


GTGAAATCAA 


3600 


AAGTATGTTA 


ACTACAGTCA 


AAGATCTTAC 


3660 


CATTGCTCTA 


TGTGAATTTG 


AAAATATTAT 


3720 


TCTAAGACCA 


ATTAGTGTCA AAAACAAGGA 


3780 


CGAATTCAAA 


AATGCTATCA 


CCAATGCGAA 


3840 


TATCACAGTT 


ACTGACAATA 


AAGGAGCATT 


3900 


AGTAGATCTT 


GGTGCCTACC 


TAGAAAAAGA 


3960 


GCATACAGCT 


ACACGTTTTT 


CAATCAAACC 


4020 


GAATGACAGG 


TCCACATATA 


TCCTCAAACT 


4080 


ACACTACACA 


CTTCATCACA 


CAAACCAATC 


4140 


CTATCTGCTA 


GACCTAGAGT 


GCGAATAGGT 


4200 


ATTAGTTAGA 


GTTCAATCAA 


TCTCAACT^C 


4260 


ATAAATCTTA 


AAATGGGAAA 


TACATCCATC 


4320 


TATTTTACAC 


TAATACATAT 


GATCTTAACT 


4380 


ATGATTGCAA 


TACTAAATAA 


GCTAAGTGAA 


4440 


CTAGGACAGA 


TGCATCAAAT 


CT^CACATAG 


4500 


ATCCTGTATA 


TATAAACAAA 


CAAATCCAAT 


4560 


ACGCCAACTA 


TCATGGTAGC 


ATAGAGTAGT 


4620 


TAGTATGGGA 


TCAAAAACAA 


CATTGGGGCA 


4680 


CGCACTGCCA 


GGACTCTAGA 


AAAGACCTGG 


4740 


TCTTGTTTAT 


ACAGATTAAA 


TTTAAAATCT 


4800 


ATGATAATCT 


CAACCTCTCT 


CATAATTGCA 


4860 


AAAGTTACAC 


TAACAACGGT 


CACAGTTCAA 


4920 


ACCACCTACC 


TTACTCAAGT 


CCCACCAGAA 


4980 




TCCACACAAA 


TTCAGCCACA 


5040 


CACACAACAG 


CACAAACCAA 


AGGCAGAACC 


5100 


ACAAAACCAC 


GCCTAAAAAA 


TCCACCAAAA 


5160 


TTCAACTTCG 


TTCCCTGTAG 


TATATGTGGC 


5220 


ACAATACCAA 


GCAACA^ACC 


AAAGAAGAAA 


5280 




CCAACCATCA AACCCACAAA CAAACCAACC 
ACACCAGCCA AAACGACGAA AAAAGAAACT 
ACGACCACAG AAAGAGACAC CAGCACCTCA 
GAACACACAA TCCAACAGCA ATCCCTCCAC 
ACACAAACAC CCACAGCATC CGAGCCCTCT 
CATGCTTAGT TATTCAAAAA CTACATCTTA 
GAAATTAAAC CTGGGGCAAA TAACCATGGA 
CCTAACTCTT GCTATTAATG CATTGTACCT 
TTACCAATCG ACATGTAGTG CAGTTAGCAG 
GTATACCAGT GTCATAACAA TAGAATTAAG 
TGACACTAAA GTAAT^CTTA TAAAACAAGA 
ATTACAGCTA CTTATGCAAA ACACACCAGC 

H ACAGTATATG AACTATACAA TCAATACCAC 

ffi 

GAGGAAACGA AGATTTCTGG GCTTCTTGTT 
?5 AGCTGTATCC AAAGTTCTAC ACCTTGAAGG 

ATCTACAAAC AAAGCTGTAG TCAGTCTATC 
|7j GTTAGATCTC AAGAATTACA TAAATAACCA 

L. TCGCATCTCC AACATTGAAA CAGTTATAGA 

SI AATCAACAGA GAATTCAGTG TCAATGCAGG 

AACAAACAGT GAGTTACTAT CATTGATCAA 
ATTAATGTCA AGCAATGTTC AGATAGTAAG 
AAAGGAAGAA GTCCTTGCAT ATGTTGTACA 
TTGCTGGAAA TTACACACAT CACCTCTATG 
TTGTTTAACA AGGACTGATA GAGGATGGTA 
TCCACAGGCT GACACTTGTA AAGTACAGTC 
TTTGACATTA CCAAGTGAAG TCAGCCTTTG 
CTGCAAAATT ATGACATCT^ AAACAGACAT 
TATAGTGTCA TGCTATGGTA ATJ^CTAAATG 
AAAGACATTT TCTAATGGTT GTGACTATGT 
GGGCT^CACT TTATACTATG TAAACAAGCT 
ACCTATAATA AATTACTATG ACCCTCTAGT 
ATCTCAAGTC AATGAAAAAA TCAATCAAAG 
ACTACATAAT GTAAATACTG GCAAATCTAC 
AGTAATCATT GTAGTATTGT TATCATTAAT 
CAAAAACACA CCAGTTACAC TAAGCAAAGA 




ACCAAAACCA 


CAAACAAAAG 


AGACCCAAAA 


5340 


ACCACCAACC 


CAACAAAAAA 


ACCAACCCTC 


5400 


CAATCCACTG 


TGCTCGACAC 


AACCACATTA 


5460 


TCAACCACCC 


CCGAAAACAC 


ACCCAACTCC 


5520 


ACATCAAATT 


CCACCCAAAA 


TACCCAATCA 


5580 


GCAGAAAACC 


GTGACCTATC 


AAGCAAGAAC 


5640 


GCTGCTGATC 


CACAGGTTAA 


GTGCAATCTT 


5700 


CACCTCAAGT 


CAGAACATAA 


CTGAGGAGTT 


5760 


AGGTTATTTT 


AGTGCTTTAA 


GAACAGGTTG 


5820 


TAATATAAAA 


GAAACCAAAT 


GCAATGGAAC 


5880 


ATTAGATAAG 


TATAAGAATG 


CAGTGACAGA 


5940 


TGCCAACAAC 


CGGGCCAGAA 


GAGAAGCACC 


6000 


TAAAAACCTA 


AATGTATCAA 


TAAGCAAGAA 


6060 


AGGTGTAGGA 


TCTGCAATAG 


CAAGTGGTAT 


6120 


AGAAGTGAAC 


AAGATCAAAA 


ATGCTTTGTT 


6180 


AAATGGGGTC 


AGTGTTTTAA 


CCAGCAAAGT 


6240 


ATTATTACCC 


ATAGTAAATC 


AACAGAGCTG 


6300 


ATTCCAGCAG 


AAGAACAGCA 


GATTGTTGGA 


6360 


TGTAACAACA 


CCTTTAAGCA 


CTTACATGTT 


6420 


TGATATGCCT 


ATAACAAATG 


ATCAGAAAAA 


6480 


GCAACAAAGT 


TATTCTATCA 


TGTCTATAAT 


6540 


GCTACCTATC 


TATGGTGTAA 


TAGATACACC 


6600 


CACCACCAAC 


ATCAAAGAAG 


GATCAAATAT 


6660 


TTGTGATAAT 


GCAGGATCAG 


TATCCTTCTT 


6720 


CAATCGAGTA 


TTTTGTGACA 


CTATGAACAG 


6780 


TAACACTGAC 


ATATTCAATT 


CCAAGTATGA 


6840 


AAGCAGCTCA 


GTAATTACTT 


CTCTTGGAGC 


6900 


CACTGCATCC 


AACAAAAATC 


GTGGGATTAT 


6960 


GTCAAACAAA 


GGAGTAGATA 


CTGTGTCAGT 


7020 


GGAAGGCAAG 


AACCTTTATG 


TAAAAGGGGA 


7080 




OAT(^GTTTG 

V3X% X XXX V3 


ATGCATCAAT 


7140 


TTTAGCTTTT 


ATTCGTAGAT 


CTGATGAATT 


7200 


TACAAATATT 


ATGATAACTA 


CAATTATTAT 


7260 


AGCTATTGGT 


TTGCTGTTGT 


ATTGCAAAGC 


7320 


CCAACTAAGT 


GGAATCAATA 


ATATTGCATT 


7380 



La 



CAGCAAATAG 


ACAAAAAACC 


ACCTGATCAT 


GTTTCAACAA 


CAGTCTGCTG 


ATCACCAATC 


7440 


CCAAATCAAC 


CCATAACAAA 


CACTTCAACA 


TCACAGTACA 


GGCTGAATCA 


TTTCTTCACA 


7500 


TCATGCTACC 


CACACAACTA 


AGCTAGATCC 


TTAACTCATA 


GTTACATAAA 


AACCTCAAGT 


7560 


ATCACAATCA 


AACACTAAAT 


CAACACATCA 


TTCACAAAAT 


TAACAGCTGG 


GGCAAATATG 


7620 


TCGCGAAGAA 


ATCCTTGTAA 


ATTTGAGATT 


AGAGGTCATT 


GCTTGAATGG 


TAGAAGATGT 


7680 


CACTACAGTC 


ATAATTACTT 


TGAATGGCCT 


CCTCATGCCT 


TACTAGTGAG 


GCAAAACTTC 


7740 


ATGTTAAACA 


AGATACTCAA 


GTCAATGGAC 


AAAAGCATAG 


ACACTTTGTC 


TGAAATAAGT 


7800 


GGAGCTGCTG 


AACTGGACAG 


AACAGAAGAA 


TATGCTCTTG 


GTATAGTTGG 


AGTGCTAGAG 


7860 


AGTTACATAG 


GATCTATAAA 


CAACATAACA 


AAACAATCAG 


CATGTGTTGC 


TATGAGTAAA 


7920 


CTTCTTATTG 


AGATCAATAG 


TGATGACATT 


AAAAAGCTGA 


GAGATAATGA 


AGAACCCAAT 


7980 


TCACCTAAGA 


TAAGAGTGTA 


CAATACTGTT 


ATATCATACA 


TTGAGAGCAA 


TAGAAAAAAC 


8040 


AACAAGCAAA 


CAATCCATCT 


GCTCAAAAGA 


CTACCAGCAG 


ACGTGCTGAA 


GAAGACAATA 


8100 


AAAAACACAT 


TAGATATCCA 


CAAAAGCATA 


ATCATAAGCA 


ACCCAAAAGA 


GTCAACCGTG 


8160 


AATGATCAAA 


ATGACCAAAC 


CAAAAATAAT 


GATATTACCG 


GATAAATATC 


CTTGTAGTAT 


8220 


ATCATCCATA 


TTGATTTCAA 


GTGAAAGCAT 


GATTGCTACA 


TTCAATCATA 


AAAACATATT 


8280 


ACAATTTAAC 


CATAACCATT 


TGGATAACCA 


CCAGCGTTTA 


TTAAATAATA 


TATTTGATGA 


8340 


AATTCATTGG 


ACACCTAAAA 


ACTTATTAGA 


TGCCACTCAA 


CAATTTCTCC 


AACATCTTAA 


8400 


CATCCCTGAA 


GATATATATA 


CAATATATAT 


ATTAGTGTCA 


TAATGCTTGG 


CCATAACGAT 


8460 


TCTATATCAT 


CCAACCATAA 


AACTATCTTA 


ATAAGGTTAT 


GGGACAAAAT 


GGATCCCATT 


8520 


ATTAATGGAA 


ACTCTGCTAA 


TGTGTATCTA 


ACTGATAGTT 


ATTTAAAAGG 


TGTTATCTCT 


8580 


TTTTCAGAAT 


GTAATGCTTT 


AGGGAGTTAC 


CTTTTTAACG 


GCCCTTATCT 


CAAAAATGAT 


8640 


TACACCAACT 


TAATTAGTAG 


ACAAAGTCCA 


CTACTAGAGC 


ATATGAATCT 


TAAAAAACTA 


8700 


ACTATAACAC 


AGTCATTAAT 


ATCTAGATAT 


CATAAAGGTG 


AACTGAAATT 


AGAAGAACCA 


8760 


ACTTATTTCC 


AGTCATTACT 


TATGACATAT 


AAAAGCATGT 


CCTCGTCTGA 


ACAAATTGCT 


8820 


ACAACTAACT 


TACTTAAAAA 


AATAATACGA 


AGAGCTATAG 


AAATAAGTGA 


TGTAAAGGTG 


8880 


TACGCCATCT 


TGAATAAACT 


AGGACTAAAG 


GAA/VAGGACA 


GAGTTAAGCC 


CAACAATAAT 


8940 


TCAGGTGATG 


AAAACTCAGT 


ACTTACAACT 


ATAATTAAAG 


ATGATATACT 


TTCGGCTGTG 


9000 


GAAAGCAATC 


AATCATATAC 


AAATTCAGAC 


AAAAATCACT 


CAGTAAATCA 


AAATATCACT 


9060 


ATCAAAACAA 


CACTCTTGAA 


AAAATTGATG 


TGTTCAATGC 


AACATCCTCC 


ATCATGGTTA 


9120 


ATACACTGGT 


TCAATTTATA 


TACAAAATTA 


AATAACATAT 


TAACACAATA 


TCGATCAAAT 


9180 




GTCATGGGTT 


TATATTAATA 


GATAATCAAA 


CTTTAAGTGG 


TTTTCAGTTT 


9240 


ATTTTAAATC 


AATATGGTTG 


TATCGTTTAT 


CATAAAGGAC 


TCAT^AAAAAT 


CACAACTACT 


9300 


ACTTACAATC 


AATTTTTAAC 


ATGGAAAGAC 


ATCAGCCTTA 


GCAGATTAAA 


TGTTTGCTTA 


9360 


ATTACTTGGA 


TAAGTAATTG 


TTTGAATACA 


TTAAATAAAA 


GCTTAGGGCT 


GAGATGTGGA 


9420 


TTCAATAATG 


TTGTGTTATC 


ACAATTATTT 


CTTTATGGAG 


ATTGTATACT 


GAAATTATTT 


9480 



Tod 



CATAATGAAG GCTTCTACAT AATAAT^GAA 
AACATAACAG AAGAAGATCA ATTTAGGAAA 
ACAGATGCAG CTATTAAGGC TCAAAAGAAC 
GACAAGACAG TGTCTGATAA TATCATAAAT 
CTTAAATTGA TTAAGCTTGC AGGTGATAAT 
CTCTTCAGAA TCTTTGGACA TCCAATGGTT 
ATTAACTGTA ATGAAACTAA GTTCTACTTA 
TTCATTTATA GAATCATAAA AGGGTTTGTA 
AATGCTATTG TCCTACCTCT AAGATGGTTA 
CTACTTGAAA TCACAGAAAA TGATTTGATT 
TTTCATCTGC CTAAAAAAGT GGATCTTGAA 
CCAAAAGATC TAATATGGAC TAGTTTTCCT 
3 TATATAGAAC ATGAAAAGTT GAAGTTCTCT 

gj TATTACTTGA GAGATAATAA ATTCAATGAA 

St AGCTATCTCA ACAACTCTAA TCACGTGGTA 

St: 

£ GTAGGTAGAA TGTTTGCTAT GCAACCAGGT 

7^ AAAATGATAG CCGAAAATAT TTTACAATTC 

CTAGAGCTTC AAAAGATATT AGAATTAAAA 
C| AATGATAACT ACAACAATTA TATCAGTAAA 

ft AATCAAGCAT TTAGATATGA AACATCATGT 

iH GGAGTACAAT CTCTGTTCTC TTGGTTGCAT 

ACATATAGAC ATGCACCTCC TTTCATAAAG 
GAACAAAGTG GATTATACAG ATATCATATG 
TGGACCATTG AAGCTATATC ATTATTAGAT 
ACAGCTCTGA TAAATGGTGA TAATCAGTCA 
GAGGGTCAGA CCCATGCTCA AGCAGATTAT 
TATT^AAGAGT ATGCAGGTAT AGGCCATAAG 
GATATGCAGT TCATGAGCAA AACAATCCAG 
AAAAAAGTCC TGAGAGTAGG TCCATGGATA 
TTAGAATCTA TAGGTAGCTT AACACAGGAG 
AGTTTAATAT TTAGGAACAT TTGGTTATAC 
GCATTATGTA ACAATAAGCT ATATTTAGAT 
TTTTTTAATC TTGATAGTAT CGATATGGCG 
TTTGGTGGTG GTGATCCTAA TTTGTTATAT 
CTTACAGAAG CTATAGTACA TTCAGTGTTT 




GTAGAGGGAT 


TTATTATGTC 


TTTAATTCTA 


9540 


CGATTTTATA 


ATAGCATGCT 


A7ATAACATC 


9600 


CTACTATCAA 


GGGTATGTCA 


CACTTTATTA 


9660 


GGTAAATGGA 


TAATCCTATT 


AAGTAAATTT 


9720 


AATCTCAATA 


ATTTGAGTGA 


GCTATATTTT 


9780 


GATGAAAGAC 


AAGCAATGGA 


TGCTGTAAGA 


9840 


TTAAGTAGTC 


TAAGTACGTT 


AAGAGGTGCT 


9900 


AATACCTACA 


ACAGATGGCC 


CACTTTAAGG 


9960 


AACTATTATA 


AACTTAATAC 


TTATCCATCT 


10020 


ATTTTATCAG 


GATTGCGGTT 


CTATCGTGAA 


10080 


ATGATAATAA ATGACAAAGC 


CATTTCACCT 


10140 


AGAAATTACA 


TGCCATCACA 


TATACAAAAT 


10200 


GAAAGCGACA 


GATCAAGAAG 


AGTACTAGAG 


10260 


TGCGATCTAT 


ACAATTGTGT 


AGTCAATCAA 


10320 


TCACTAACTG 


GTAAAGAAAG 


AGAGCTCAGT 


10380 


ATGTTTAGGC 


AAATCCAAAT 


CTTAGCAGAG 


10440 


TTCCCTGAGA 


GTTTGACAAG 


ATATGGTGAT 


10500 


GCAGGAATAA 


GCAACAAGTC 


AAATCGTTAT 


10560 


TGTTCTATCA 


TTACAGATCT 


TAGCAAATTC 


10620 


ATCTGCAGTG 


ATGTATTAGA 


TGAACTGCAT 


10680 


TTAACAATAC 


CTCTTGTCAC 


AATAATATGT 


10740 


GATCATGTTG 


TTAATCTTAA 


TGAAGTTGAT 


10800 


GGTGGTATTG 


AGGGCTGGTG 


TCAAAAACTG 


10860 


CTAATATCTC 


TCAAAGGGAA 


ATTCTCTATC 


10920 


ATTGATATAA 


GTAAACCAGT 


TAGACTTATA 


10980 


TTGTTAGCAT 


TAAATAGCCT 


TAAATTGCTA 


11040 


CTTAAGGGAA 


CAGAGACCTA 


TATATCCCGA 


11100 


CACAATGGAG 


TGTACTATCC 


AGCCAGTATC 


11160 


AATACAATAC 


TTGATGATTT 


TAAAGTTAGT 


11220 


TTAGAATACA 


GAGGGGAAAG 


CTTATTATGC 


11280 


AATCAAATTG 


CTTTGCAACT 


CCGAAATCAT 


11340 


ATATTGAAAG 


TATTAAAACA 


CTTAT^AAACT 


11400 


TTATCATTGT 


ATATGAATTT 


GCCTATGCTG 


11460 


CGAAGCTTTT 


ATAGGAGAAC 


TCCAGACTTC 


11520 


GTGTTGAGCT 


ATTATACTGG 


TCACGATTTA 


11580 




2^ 



CAAGATAAGC 


TCCAGGATCT 


TCCAGATGAT 


AGACTGAACA 


AATTCTTGAC 


ATGTGTCATC 


11640 


ACATTCGATA 


AAAATCCCAA 


TGCCGAGTTT 


GTAACATTGA 


TGAGGGATCC 


ACAGGCGTTA 


11700 


GGGTCTGAAA 


GGCAAGCTAA 


AATTACTAGT 


GAGATTAATA 


GATTAGCAGT 


AACAGAAGTC 


11760 


TTAAGTATAG 


CTCCAAACAA 


AATATTTTCT 


AAAAGTGCAC 


AACATTATAC 


TACCACTGAG 


11820 


ATTGATCTAA 


ATGACATTAT 


GCAAAATATA 


GAACCAACTT 


ACCCTCATGG 


ATTAAGAGTT 


11880 


GTTTATGAAA 


GTCTACCTTT 


TTATAAAGCA 


GAAAAAATAG 


TTAATCTTAT 


ATCAGGAACA 


11940 


AAATCCATAA 


CTAATATACT 


TGAAAAAACA 


TCAGCAATAG 


ATACAACTGA 


TATTAATAGG 


12000 


GCTACTGATA 


TGATGAGGAA 


AAATATAACT 


TTACTTATAA 


GGATACTTCC 


ACTAGATTGT 


12060 


AACAAAGACA 


AAAGAGAGTT 


ATTAAGTTTA 


GAAAATCTTA 


GTATAACTGA 


ATTAAGCAAG 


12120 


TATGTAAGAG 


AAAGATCTTG 


GTCATTATCC 


AATATAGTAG 


GAGTAACATC 


GCCAAGTATT 


12180 


ATGTTCACAA 


TGGACATTAA ATATACAACT 


AGCACTATAG 


CCAGTGGTAT 


AATTATAGAA 


12240 


AAATATAATG 


TTAATAGTTT 


AACTCGTGGT 


GAAAGAGGAC 


CTACTAAGCC 


ATGGGTAGGT 


12300 


TCATCTACGC 


AGGAGAAAAA AACAATGCCA 


GTGTACAATA 


GACAAGTTTT 


AACCAAAAAG 


12360 


CAAAGAGACC 


AAATAGATTT 


ATTAGCAAAA 


TTAGACTGGG 


TATATGCATC 


CATAGACAAC 


12420 


AAAGATGAAT 


TCATGGAAGA ACTGAGTACT 


GGAACACTTG 


GACTGTCATA 


TGAAAAAGCC 


12480 


AAAAAGTTGT 


TTCCACAATA 


TCTAAGTGTC 


AATTATTTAC 


ACCGTTTAAC 


AGTCAGTAGT 


12540 


AGACCATGTG 


AATTCCCTGC 


ATCAATACCA 


GCTTATAGAA 


CAACAAATTA 


TCATTTCGAT 


12600 


ACTAGTCCTA 


TCAATCATGT 


ATTAACAGAA 


AAGTATGGAG 


ATGAAGATAT 


CGACATTGTG 


12660 


TTTCAAAATT 


GCATAAGTTT 


TGGTCTTAGC 


CTGATGTCGG 


TTGTGGAACA 


ATTCACAAAC 


12720 


ATATGTCCTA 


ATAGAATTAT 


TCTCATACCG 


AAGCTGAATG 


AGATACATTT 


GATGAAACCT 


12780 


CCTATATTTA 


CAGGAGATGT 


TGATATCATC 


AAGTTGAAGC 


AAGTGATACA 


AAAACAGCAT 


12840 


ATGTTCCTAC 


CAGATAAAAT 


AAGTTTAACC 


CAATATGTAG 


AATTATTCCT 


AAGTAACAAA 


12900 


GCACTTAAAT 


CTGGATCTAA 


CATCAATTCT 


AATTTAATAT 


TAGTACATAA 


AATGTCTGAT 


12960 


TATTTTCATA 


ATGCTTATAT 


TTTAAGTACT 


AATTTAGCTG 


GACATTGGAT 


TCTAATTATT 


13020 


CAACTTATGA 


AAGATTCAAA 


AGGTATTTTT 


GAAAAAGATT 


GGGGAGAGGG 


GTACATAACT 


13080 


GATCATATGT 


TCATTAATTT 


GAATGTTTTC 


TTTAATGCTT 


ATAAGACTTA 


TTTGCTATGT 


13140 


TTTCATAAAG 


GTTATGGTAA 


AGCAAAATTA 


GAATGTGATA 


TGAACACTTC 


AGATCTTCTT 


13200 


TGTGTTTTGG 


AGTTAATAGA 


CAGTAGCTAC 


TGGAAATCTA 


TGTCTAAAGT 


TTTCCTAGAA 


13260 


CAAAAAGTCA 


TAAAATACAT 


AGTCAATCAA 


GACACAAGTT 


TGCATAGAAT 


AAAAGGCTGT 


13320 


CACAGTTTTA 


AGTTGTGGTT 


TTTAAAACGC 


CTTAATAATG 


CTAAATTTAC 


CGTATGCCCT 


13380 


TGGGTTGTTA 


ACATAGATTA 


TCACCCAACA 


CATATGAAAG 


CTATATTATC 


TTACATAGAT 


13440 


TTAGTTAGAA 


TGGGGTTAAT 


AAATGTAGAT 


AAATTAACCA 


TTAAAAATAA 


AAACAAATTC 


13500 


AATGATGAAT 


TTTACACATC 


AAATCTCTTT 


TACATTAGTT 


ATAACTTTTC 


AGACAACACT 


13560 


CATTTGCTAA 


CAAAACAAAT 


AAGAATTGCT 


AATTCAGAAT 


TAGAAGATAA 


TTATAACAAA 


13620 


CTATATCACC 


CAACCCCAGA 


AACTTTAGAA 


AATATATCAT 


TAATTCCTGT 


TAAAAGTAAT 


13680 




AATAGTAACA AACCTAAATT TTGTATAAGT GGAAATACCG AATCTATAAT GATGTCAACA 13740 

TTCTCTAATA AAATGCATAT TAAATCTTCC ACTGTTACCA CAAGATTCAA TTATAGCAAA 13800 

CAAGACTTGT ACAATTTATT TCCAAATGTT GTGATAGACA GGATTATAGA TCATTCAGGT 13 860 

AATACAGCAA AATCTAACCA ACTTTACATC ACCACTTCAC ATCAGACATC TTTAGTAAGG 13920 

AATAGTGCAT CACTTTATTG CATGCTTCCT TGGCATCATG TCAATAGATT TAACTTTGTA 13980 

TTTAGTTCCA CAGGATGCAA GATCAGTATA GAGTATATTT TAAAAGATCT TAAGATTAAG 14040 

GACCCCAGTT GTATAGCATT CATAGGTGAA GGAGCTGGTA ACTTATTATT ACGTACGGTA 14100 

GTAGAACTTC ATCCAGACAT AAGATACATT TACAGAAGTT TAAAAGATTG CAATGATCAT 14160 

AGTTTACCTA TTGAATTTCT AAGATTATAC AACGGGCATA TAAACATAGA TTATGGTGAG 14220 

AATTTAACCA TTCCTGCTAC AGATGCAACT AATAACATTC ATTGGTCTTA TTTACATATA 14280 

AAATTTGCAG AACCTATTAG CATCTTTGTC TGCGATGCTG AATTACCTGT TACAGCCAAT 14340 

TGGAGTT^AAA TTATAATTGA ATGGAGTAAG CATGTAAGAA AGTGCAAGTA CTGTTCTTCT 14400 

GTAAATAGAT GCATTTTAAT CGCAAAATAT C7VTGCTCAAG ATGATATTGA TTTCAAATTA 14460 

GATAACATTA CTATATTAAA AACTTACGTG TGCCTAGGTA GCAAGTTAAA AGGATCTGAA 1452 0 

GTTTACTTAG TCCTTACAAT AGGCCCTGCA AATATACTTC CTGTTTTTGA TGTTGTGCAA 14580 

AATGCTAAAT TGATTTTTTC AAGAACTAAA AATTTCATTA TGCCTAAAAA AACTGACAAG 14640 

GAATCTATCG ATGCAAATAT TJ\AAAGCTTA ATACCTTTCC TTTGTTACCC TATAACAAAA 14700 

AAAGGAATTA AGACTTCATT GTCAAAATTG AAGAGTGTAG TTAATGGGGA TATATTATCA 14760 

TATTCTATAG CTGGACGTAA TGAAGTATTC AGCAACAAGC TTATAAACCA CAAGCATATG 14820 

AATATCCTAA AATGGCTAGA TCATGTTTTA AATTTTAGAT CAGCTGAACT TAATTACAAT 14880 

CATTTATACA TGATAGAGTC CACATATCCT TACTTAAGTG AATTGTTAAA TAGTTTAACA 14940 

ACCAATGAGC TCAAGAAACT GATTAAAATA ACAGGTAGTG TACTATACAA CCTTCCCAAC 15000 

GAACAGTAAC TTAAAATATC ATTAACAAGT TTGGTCAAAT TTAGATGCTA ACACATCATT 15060 

ATATTATAGT TATTAAAAAA TATGCAAACT TTTCAATAAT TTAGCTTACT GATTCCAAAA 1512 0 

TTATCATTTT ATTTTTAAGG GGTTGAATAA AAGTCTAAAA CTAACAATGA TACATGTGCA 15180 

TTTACAACAC AACGAGACAT TAGTTTTTGA CACTTTTTTT CTCGT 15225 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ACTCAAATAA GTTAATAAAA AATATCCCGG GAT 



33 



^3 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
CCCGGGATAT TTTTTATTAA CTTATTTGAG T 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GAAAGTATAT ATTATGTT 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHT^RACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
TATATAAGCA CGATGATATG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
ACTCAAATAA GTTAAT 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
TAACTTATTT GAGT 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GACACAACCC ACAATGATAA TACACCAC 28 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CATCTCTAAC CAAGGGAGTT AAATTTAAGT GG 32 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTAAGGAGAG ATATAAGATA GAAGATG 27 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GTTTTATATT AACTAATGGT GTTAGTG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
TTATAATTGC AGCCATCATA TTCATAGCCT CGG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 
GTGAAGTTGA GATTACAATT GCCAGAATGG 



